Latent semantic indexing and its applications in chinese text processing 隱含語義索引及其在中文文本處理中的應(yīng)用研究
5 hull d a . improving text retrieval for the routing problem using latent semantic indexing . in proc 即在句子中頻繁同時出現(xiàn)的詞,并將其作為文檔的特征詞組。
Research on semantic indexing system and semantic retrieval model for chinese medical information based on multilayer conceptual semantic network structure 基于多層次概念語義網(wǎng)絡(luò)結(jié)構(gòu)的中文醫(yī)學(xué)信息語義標(biāo)引體系和語義檢索模型研究
This paper proposes a method, called tckm ( task-circumstance-based km ), which adopts the structural description oriented to task circumstance as the semantic indexes for the suitability of information body content and the query requirement of work practice, hence can solve effectively this problem 本文提出的基于任務(wù)情景的知識管理方法tckm,以面向任務(wù)情景的結(jié)構(gòu)化描述作為信息體內(nèi)容適用性和工作實踐查詢需求的語義索引,可以有效地解決這一挑戰(zhàn)性問題。
Under such background, this paper proposes a method, called tckm ( task-circumstance-based km ), which adopts the structural description oriented to task circumstance as the semantic indexes for the suitability of information body content and the query requirement of work practice, hence can solve effectively this problem 在此背景下,本文提出的基于任務(wù)情景的知識管理方法tckm,以面向任務(wù)情景的結(jié)構(gòu)化描述作為信息體內(nèi)容適用性和工作實踐查詢需求的語義索引,可以有效地解決這一挑戰(zhàn)性問題。
This paper researches and discusses the theory of latent semantic index, include the theory of single value decompose and word-document matrix . in this paper the author discusses the application of latent semantic index in chinese document clustering based on latent semantic index, researches and discusses vector space model, latent semantic index, electronic dictionary, word-splitting and the algorithm of k-means . this paper presents a improved structure of electronic dictionary and a improved algorithm of word-spliting 本文對潛在語義索引模型進(jìn)行系統(tǒng)的研究和探討,包括奇異值分解等相關(guān)矩陣?yán)碚?、詞-文檔矩陣等;同時本文研究和探討了潛在語義索引模型在中文文本聚類中的具體應(yīng)用和實現(xiàn),包括文本間相似度的度量、詞-文檔矩陣、奇異值分解的具體實現(xiàn);同時本文對中文文本聚類所涉及的其他一些中文處理技術(shù),包括向量空間模型、電子字典、切詞、k-means聚類算法等也進(jìn)行了研究和探討。
This paper researches and discusses the theory of latent semantic index, include the theory of single value decompose and word-document matrix . in this paper the author discusses the application of latent semantic index in chinese document clustering based on latent semantic index, researches and discusses vector space model, latent semantic index, electronic dictionary, word-splitting and the algorithm of k-means . this paper presents a improved structure of electronic dictionary and a improved algorithm of word-spliting 本文對潛在語義索引模型進(jìn)行系統(tǒng)的研究和探討,包括奇異值分解等相關(guān)矩陣?yán)碚摗⒃~-文檔矩陣等;同時本文研究和探討了潛在語義索引模型在中文文本聚類中的具體應(yīng)用和實現(xiàn),包括文本間相似度的度量、詞-文檔矩陣、奇異值分解的具體實現(xiàn);同時本文對中文文本聚類所涉及的其他一些中文處理技術(shù),包括向量空間模型、電子字典、切詞、k-means聚類算法等也進(jìn)行了研究和探討。
This paper researches and discusses the theory of latent semantic index, include the theory of single value decompose and word-document matrix . in this paper the author discusses the application of latent semantic index in chinese document clustering based on latent semantic index, researches and discusses vector space model, latent semantic index, electronic dictionary, word-splitting and the algorithm of k-means . this paper presents a improved structure of electronic dictionary and a improved algorithm of word-spliting 本文對潛在語義索引模型進(jìn)行系統(tǒng)的研究和探討,包括奇異值分解等相關(guān)矩陣?yán)碚摗⒃~-文檔矩陣等;同時本文研究和探討了潛在語義索引模型在中文文本聚類中的具體應(yīng)用和實現(xiàn),包括文本間相似度的度量、詞-文檔矩陣、奇異值分解的具體實現(xiàn);同時本文對中文文本聚類所涉及的其他一些中文處理技術(shù),包括向量空間模型、電子字典、切詞、k-means聚類算法等也進(jìn)行了研究和探討。
This paper researches and discusses the theory of latent semantic index, include the theory of single value decompose and word-document matrix . in this paper the author discusses the application of latent semantic index in chinese document clustering based on latent semantic index, researches and discusses vector space model, latent semantic index, electronic dictionary, word-splitting and the algorithm of k-means . this paper presents a improved structure of electronic dictionary and a improved algorithm of word-spliting 本文對潛在語義索引模型進(jìn)行系統(tǒng)的研究和探討,包括奇異值分解等相關(guān)矩陣?yán)碚?、詞-文檔矩陣等;同時本文研究和探討了潛在語義索引模型在中文文本聚類中的具體應(yīng)用和實現(xiàn),包括文本間相似度的度量、詞-文檔矩陣、奇異值分解的具體實現(xiàn);同時本文對中文文本聚類所涉及的其他一些中文處理技術(shù),包括向量空間模型、電子字典、切詞、k-means聚類算法等也進(jìn)行了研究和探討。